Energies of the first row atoms from quantum Monte Carlo 
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All-electron variational and diffusion quantum Monte Carlo calculations of the ground state en- 
ergies of the first row atoms (Li to Ne) are reported. We use trial wavefunctions of four types: 
single determinant Slater- Jastrow wavefunctions; multi-determinant Slater- Jastrow wavefunctions; 
single determinant Slater-Jastrow wavefunctions with backflow transformations; multi-determinant 
Slater-Jastrow wavefunctions with backflow transformations. At the diffusion quantum Monte Carlo 
level and using our best trial wavefunctions we recover 99% or more of the correlation energy for 
Li, Be, B, C, N, and Ne, 97% for O, and 98% for F. 

PACS numbers: 



I. INTRODUCTION 

Quantum Monte Carlo (QMC) methods can yield 
highly accurate energies for atoms, molecules and 
solids 1 ' 2 , offering an alternative to quantum chemistry 
approaches such as Configuration Interaction and Cou- 
pled Cluster—. Although the cost of QMC calculations is 
large, the scaling with system size is much better than 
for accurate quantum chemistry approaches, and QMC 
methods have been applied to systems with of order 10 3 
electrons^. 

The two most commonly used continuum QMC meth- 
ods are variational quantum Monte Carlo (VMC) and 
diffusion quantum Monte Carlo (DMC). Stochastic in- 
tegration is used in VMC to evaluate expectation values 
with a trial wavefunction, to a precision which can be sys- 
tematically improved by increasing the number of config- 
urations sampled. DMC is a stochastic projector method 
in which a wavefunction is evolved in imaginary time to- 
wards the ground state. Exact projector methods suffer 
from a fermion sign problem, and to overcome this we 
use the standard fixed-node approximation^ in which 
the nodal surface of the wavefunction is fixed during the 
evolution. The fixed-node approximation is uncontrolled, 
but the DMC energy is lower than the VMC energy with 
the same input trial wavefunction, and both give upper 
bounds on the exact energy. 

DMC gives the lowest energy consistent with the fixed 
nodal surface. It is therefore advantageous to optimize 
parameters in the trial wavefunction, which may improve 
the trial nodal surface. The simplest and most widely- 
used trial wavefunction is the Slater-Jastrow (SJ) form, 
which consists of a Slater determinant of single-particle 
orbitals multiplied by a Jastrow correlation factor. More 
advanced wavefunctions can be obtained by, for example, 
replacing the single determinant by a sum over configu- 
ration state functions (CSFs), by using backflow trans- 
formations of the electronic coordinates^—, or by using 
pairing wavefunctionaii~— . 

We present results of QMC calculations for the first 
row atoms (Li to Ne) obtained using trial wavefunctions 
of single and multi-determinant forms constructed from 



orbitals calculated on radial grids, both with and with- 
out backflow transformations. Our aims are to assess 
the different roles of multi-determinants and backflow 
in improving the trial wavefunctions for the first row 
atoms and to obtain benchmark VMC and DMC ener- 
gies for them. We find that the use of multiple CSFs 
and backflow transformations recovers a substantial ad- 
ditional fraction of the correlation energy at both the 
VMC and DMC levels. Our best wavefunctions recover 
99% or more of the correlation energy at the DMC level 
for Li, Be, B, C, N, and Ne, and more than 97% for 
O and 98% for F. This represents a substantial advance 
upon previous DMC calculations. 



II. WAVEFUNCTION FORM AND 
OPTIMIZATION 

Our multi-CSF SJ wavefunctions with backflow trans- 
formations can be written in the form 



*({rj,p) 



= e J({n},b) 



Ncsf 



^^({x,}), (1) 



where each = Xi({rj},c) results from the backflow 
transformation of the electronic coordinates {r^}, each 
$ n is a multideterminant representation of a CSF, and 
p = (a, b, c) are the variable wavefunction parameters: 
the CSF coefficients a, the Jastrow parameters b, and 
the backflow parameters c. It is worth noting that the 
Jastrow factor normally introduces spin-contamination 14 
and the backflow transformation angular momentum con- 
tamination into the trial wavefunction, so that "J will 
not normally be a spin/spatial angular momentum eigen- 
state. However, the VMC and DMC energies still provide 
upper bounds on the true ground state energies. 

The general form of our Jastrow factor J({ri},b) is 
described in an earlier paper—. In this study we used 
Jastrow factors containing electron-electron, electron- 
nucleus, and electron-electron-nucleus terms. Our back- 
flow transformations also consisted of electron-electron, 
electron- nucleus, and electron-electron-nucleus terms, as 
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described in Refi For both the Jastrow and backflow 
functions, the expansion orders and spin dependencies 
were chosen by increasing the variational freedom un- 
til no further lowering of the VMC energy was observed. 
The maximum number of variable parameters in the trial 
wavefunctions varied from 146 for Li to 252 for B, N, C, 
O, and F. The number of variable parameters is therefore 
only weakly dependent on the atomic number. 

We used the multiconfigurational Hartree-Fock 
(MCHF) atomic-structure package ATSP2K l §. to con- 
struct the functions appearing in Eq. (fTJ). This 
provides the lowest energy solution of the many-electron 
problem where the solution is limited to an 'active space' 
(AS) specified by an allowed set of subshell occupation 
numbers, or configurations. The wavefunction may 
formally be written as a sum of CSFs, 

AS 

\LS) as = J2 a »\» LS )> ( 2 ) 
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FIG. 1: (Color online) The percentage of the correlation en- 
ergy recovered for each atom as a function of the number of 
CSFs included in the MCHF wavefunction. Core excitations 
were included for Li but not for the other atoms. 



where v specifies a configuration (for example, ls 2 2s 2 2p 3 ) 
and seniority number—. Each CSF, \vLS), is defined by 
a radial orbital for each subshell, and by the total spa- 
tial/spin angular momentum quantum numbers, usually 
denoted 2S+l L. 

An additional freedom exists because the underlying 
symmetry of the Hamiltonian results in degenerate solu- 
tions corresponding to different projection angular mo- 
mentum eigenvalues, L z and S z . These additional quan- 
tum numbers are not required to specify the total en- 
ergy, the coefficients a„, and the mean field equations 
for the radial orbitals, and they are not provided by the 
ATSP2K code. As a result it is not straightforward to 
obtain an explicit form for the CSFs which is suitable for 
use in QMC calculations. For the calculations presented 
here an appropriate representation of each CSF, $„, was 
constructed as follows. 

First we note that two separate angular momentum 
eigenstates containing n\ and ni electrons may be cou- 
pled together to generate angular momentum eigen- 
states containing n\ + ri2 electrons^. Starting with sin- 
gle electrons and performing such a coupling iteratively 
for every allowed combination provides all of the total 
spin/angular momentum eigenstates possible for a given 
configuration as sums of determinants of spin-orbitals. 
Due care was taken to avoid redundant computation, to 
respect antisymmetry, and to employ the same coupling 
order/sign conventions^ as used in ATSP2K, allowing 
the efficient construction of each CSF as a sum of deter- 
minants of one-electron spin-orbitals. 

A different wavefunction results from each choice of L z 
and S z . The spin independence of the Hamiltonian allows 
a direct conversion of the sum of determinants of complex 
spin-orbitals into a sum of products of two determinants 
of real orbitals, one for spin up and one for spin down 
electrons. This procedure provides a representation of 



each CSF in the form 

^ U LS = ^2a j D]{vLS)Dj(yLS), ( 3 ) 
j 

where the determinants are of real single-particle orbitals, 
and such that ^ v a„<I>„LS has the same energy expec- 
tation value as the original MCHF wavefunction. Note 
that although $> v ls has total spatial angular momentum 
L and projected spin angular momentum S z , it is not 
necessarily an eigenstate of L z or S 2 . 

It should be clear that the <£>„ are not uniquely spec- 
ified by (v 2S+1 L) due to the spherical symmetry of the 
underlying Hamiltonian. In what follows we make the 
specific choice \L X \ = L, as this provides the smallest 
multideterminant expansions, and |>S Z | = S, as this pro- 
vides that same number of spin up/down electrons as 
predicted by Hund's first rule. 

To define the multideterminant part of Eq. (p} we take 
2S+1 L from the lowest energy HF state, and define an AS 
using the following rules for allowed excitations from the 
HF ground state configuration: only single and double 
(SD) electronic excitations were allowed, excitations of 
the Is core electrons were not allowed, and excitations 
were allowed into orbitals with quantum numbers n < 7 
and I < 5. A MCHF calculation was then performed for 
the chosen AS, and the resulting CSFs ordered by weight, 
a 2 . All but the Nqsf largest- weight configurations were 
then discarded, and a MCHF calculation was performed 
using this reduced AS(Ncsf) active space. The percent- 
ages of the correlation energies recovered at the MCHF 
level are shown in Fig. [T]for Nqsf < 100. 

Li is an exception: the above rules define an active 
space that is equivalent to HF because of the absence of 
Is excitations. In order to generate a nontrivial AS for Li 
we allowed excitations of all electrons into orbitals with 
quantum numbers n < 5 and I < 5. 
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Li ( 2 S) 


Be CS) 


B ( 2 P) 


C ( 3 P) 


N ( 4 S) 


( 3 P) 


F ( 2 P) 


Ne CS) 


VMC 


-7.47683(3) 


-14.6311(1) 


-24.6056(2) 


-37.8147(1) 


-54.5482(3) 


-75.0233(3) 


-99.6874(3) 


-128.9047(4) 


VMC-BF 


-7.47792(2) 


-14.6426(1) 


-24.6377(1) 


-37.8315(1) 


-54.5776(1) 


-75.0522(2) 


-99.7187(2) 


-128.9263(3) 


VMC-MD 


-7.47752(3) 


-14.66630(4) 


-24.65055(6) 


-37.8383(1) 


-54.5782(1) 


-75.0429(2) 


-99.7054(3 


-128.9206(3) 


VMC-MDBF 


-7.47799(1) 


-14.66716(2) 


-24.65254(4) 


-37.84130(8) 


-54.5840(1) 


-75.0566(2) 


-99.7220(2) 


-128.9246(4) 


DMC 


-7.47802(1) 


-14.65717(7) 


-24.63978(5) 


-37.8295(1) 


-54.5769(4) 


-75.0516(5) 


-99.7167(8) 


-128.9223(4) 


DMC-BF 


-7.478042(4) 


-14.65811(6) 


-24.64548(9) 


-37.83709(8) 


-54.5827(2) 


-75.0584(2) 


-99.7248(2) 


-128.9321(2) 


DMC-MD 


-7.47800(1) 


-14.66729(1) 


-24.65325(5) 


-37.84317(7) 


-54.5857(2) 


-75.0578(3) 


-99.7237(3) 


-128.9307(3) 


DMC-MDBF 


-7.478060(2) 


-14.667328(6) 


-24.65362(3) 


-37.84388(5) 


-54.5873(1) 


-75.0615(2) 


-99.7277(2) 


-128.9339(3) 


DMC2-MDBF 


-7.478058(2) 


-14.667319(6) 


-24.65357(3) 


-37.84385(4) 


-54.5873(1) 


-75.0617(2) 


-99.7274(2) 


-128.9339(3) 


Ehf 


-7.432727 


-14.573023 


-24.529061 


-37.688619 


-54.400934 


-74.809398 


-99.409349 


-128.547098 


E cxact 


-7.47806032" 


-14.66736 6 


-24.65391 6 


-37.8450 b 


-54.5892 6 


-75.0673 6 


-99.7339 6 


-128.9376 b 


-Ehf — Scxact 


0.04533332 


0.094337 


0.124849 


0.156381 


0.188266 


0.257902 


0.324551 


0.390502 


DMC2-corr% 


99.995(4)% 


99.957(6)% 


99.73(2)% 


99.26(3)% 


98.99(5)% 


97.83(8)% 


98.00(6)% 


99.05(8)% 



TABLE I: The VMC and DMC energies (in Hartrees) for each atom, using a single determinant SJ wavefunction, a single 
determinant SJ wavefunction with backflow (BF), multiple determinants (MD), and both together (MDBF). The DMC2-MDBF 
results are for a second DMC run with half the timestep of DMC-MDBF. Also shown are the Hartree-Fock single determinant 
energies (Ehf), the "exact" energies E oxac t, the correlation energies Ehf — E cxact , and the percentage of the correlation energy 
recovered by the DMC2-MDBF calculations (DMC2-corr%). The numbers in parentheses indicate the statistical uncertainty 
in the last digit shown. 

a taken from Ref.— (rounded to 9 significant figures). 
b taken from Ref&. 



We chose Ncsf = 20 for the multidctcrminant expan- 
sions used in the QMC calculations (corresponding to a 
number of determinants ranging from 83 for Li to 499 for 
Ne) . Although the application of this selection criterion 
to the first row atoms is necessarily somewhat arbitrary, 
it is not without justification. Figure Q] shows that for 
around 20 CSFs the fraction of correlation energy recov- 
ered within MCHF is roughly equivalent from Be to Ne. 
Including further orbitals in the initial 'large' AS, includ- 
ing triple excitations, or allowing core excitations, did not 
significantly change the MCHF energies for Ncsf < 20, 
with the exceptions of Li and Be for which core excita- 
tions improved the MCHF energies significantly for small 
Ncsf, but had a negligible impact on the DMC results. 

The variable parameters p were optimized by mini- 
mizing the VMC energy, using the scheme developed 
very recently by Umrigar et a/j 19 i 20 . This scheme is a 
generalisation of the method of Nightingale and Melik- 
Alaverdian 21 to include non-linear parameters in the trial 
wavefunction. In the past, variance minimization tech- 
niques were normally used because they are robust for op- 
timizing Jastrow factors, but in fact they have been found 
to behave poorly when optimizing linear coefficients. The 
method of Nightingale and Melik- Alaverdian^i is very ro- 
bust for optimizing linear coefficients, and Umrigar et 
a h 19 i 20 have developed an energy minimization scheme 
which is robust for both linear and non-linear parameters. 
All of the wavefunction parameters were optimized simul- 
taneously in our calculations. In some cases we obtained 
significantly lower VMC and DMC energies with energy 
minimization than with variance minimization when op- 
timizing the CSF coefficients a, and the performance for 



the Jastrow and backflow parameters (b and c) was also 
slightly improved. 



III. RESULTS 

We used the CASINO code 2 ^ for all of our VMC and 
DMC calculations. The wavefunction optimizations were 
performed using 10 5 statistically independent electronic 
configurations. The target population of walkers in the 
DMC calculations was about 2000 in each case, which 
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FIG. 2: (Color online) The variation of the DMC energy (in 
Hartrees) with timestep for the C atom, using a MDBF wave- 
function. The statistical error bars on the data are shown. 
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FIG. 3: (Color online) The percentage of the correlation en- 
ergy recovered for each atom within VMC, using a single 
determinant SJ wavefunction (VMC) with the addition of 
backfiow (VMC-BF), multiple determinants (VMC-MD), and 
both together (VMC-MDBF). The statistical error bars (not 
shown) are smaller than the symbols. 



should ensure that any population control bias is negligi- 
ble. The timesteps for the DMC calculations were chosen 
to make the systematic finite timestep errors smaller than 
the random statistical errors. The smallest timesteps 
used, corresponding to the DMC2 data in Table [U ranged 
from 0.00375 a.u. for Li to 0.0007 a.u. for Ne. The vari- 
ation of the DMC energy of the C atom with timestep is 
shown in Fig. [2 and the shape of this curve is typical of 
our calculations for the first row atoms. For C, the DMC 
acceptance ratios varied from 93(1)% for the largets time 
step (0.096 a.u.) to 99.8(1)% for the smallest time step 
(0.0015 a.u.). 

Table Q] gives the VMC and DMC energies, includ- 
ing the most accurate correlation energy percentages 
achieved for each atom. The "exact" non-relativistic en- 
ergies, assuming a point nucleus of infinite mass, were 
taken from Refsi 22 ' 23 . Fig. [3] shows the percentage of 
each atom's correlation energy recovered by the opti- 
mized wavefunctions within VMC as the multi-CSF ex- 
pansion and backfiow transformations were added. Fig.0] 
gives the same information at the DMC level. 

The general shapes of the curves in Figs. [3] and 0] are 
similar, but note that the VMC graph covers a range 
of 40%, while the DMC one covers only 12%. Our best 
VMC results surpass the accuracy previously achieved 
in DMC calculations using single determinant SJ trial 
wavefunctions 2 ^— . Our single determinant DMC results 
are similar to those reported by other authors^—. For 
Li and Be our best DMC energies are within error bars of 
some previous DMC calculations, but for the other atoms 
our results are clearly superior. Flad et alM performed 
multideterminant DMC calculations for the atoms B to 
F, but we have used larger expansions and our results 
are already superior at the DMC-MD level. Casula and 



Sorella 11 performed DMC calculations for the atoms Li- 
Ne with correlated geminal pairing wavefunctions, but 
our energies are substantially better than theirs. VMC 
and DMC calculations using backfiow wavefunctions have 
been reported for Li£ and Ne£. Liichow and Fink— per- 
formed a DMC calculation for N using a pair natural 
orbital configuration interaction trial wavefunction, re- 
trieving 98.2% of the correlation energy, which is very 
similar to our DMC-MD result, but our DMC-MDBF 
energy is lower. Our results for Li and Be are surpassed 
by Hylleraas variational calculation a 22 ' 30 . We should also 
note the existence of high quality non- variational results 
such as those from explicitly correlated r\2 Configuration 
Interaction and Coupled Cluster method o 31 ' 32 . 

Each of the VMC and DMC energies for Li corresponds 
to recovering well over 90% of the correlation energy. The 
single-determinant HF nodal surface of Li is extremely 
accurate, but incorporating backfiow improves the DMC 
energy noticeably. Backfiow gives a significant improve- 
ment for Be at the VMC level, but a very modest im- 
provement within DMC. The ground state wavefunction 
for Be contains a substantial admixture of the ls 2 2p 2 
excited state configuration which significantly alters the 
nodal surface. The single determinant ls 2 2s 2 HF nodal 
surface divides the configuration space into four nodal 
pockets, while the exact wavefunction has only two nodal 
pocket a 33 ' 34 . It does not appear possible to achieve an 
accurate DMC energy for Be with a trial wavefunction 
containing four nodal pockets. The continuous back- 
flow transformations we use are incapable of changing the 
number of nodal pockets^ and consequently the single de- 
terminant DMC energy for Be is very poor. When an ap- 
propriate amount of the ls 2 2p 2 configuration is included 
in the trial wavefunction, the DMC energy improves dra- 
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FIG. 4: (Color online) The percentage of the correlation en- 
ergy recovered for each atom within DMC, using a single 
determinant SJ wavefunction (DMC) with the addition of 
backfiow (DMC-BF), multiple determinants (DMC-MD), and 
both together (DMC-MDBF). The statistical error bars (not 
shown) are smaller than the symbols. 



5 



matically, and backflow contributes a small additional 
improvement. 

The percentages of the correlation energy recovered for 
N at the VMC level from the single determinant backflow 
and 20 CSF wavefunctions are almost the same, while for 
higher atomic numbers backflow gives more correlation 
energy than the CSFs. Fig.[TJshows that the percentages 
of the correlation energy recovered within the MCHF 
calculations for O and N increase relatively slowly with 
the number of CSFs, which may be connected with the 
lower percentages of correlation energy recovered in the 
MD and MDBF calculations for these atoms. Note that, 
apart from Li and Be where the DMC-MD energies with 
20 CSFs are already excellent, backflow adds substan- 
tially to the correlation energy recovered. It seems that 
backflow and multi-CSFs improve the single determinant 
wavefunctions in rather different ways, and therefore it 
can be very advantageous to combine them. 

One might expect that the VMC-MDBF energies 
would always be lower than the VMC-BF and VMC-MD 
energies, as the wavefunction parameters are obtained 
by minimizing the VMC energy. Table U shows, however, 
that this does not hold for Ne. The most likely reason for 
this is that the HF and MCHF orbitals are different, so 
that the single determinant wavefunction does not cor- 
respond to a particular choice of the MD wavefunction 
parameter values. This also indicates that further reduc- 
tions in the VMC energy could be obtained by optimizing 
the orbitals along with the Jastrow and backflow param- 
eters and the CSF coefficients. 



IV. DISCUSSION 

As can be seen from Fig. 21 we recover 99% or more 
of the correlation energy for all the atoms except O and 
F. The different freedoms introduced by the multi-CSF 
expansion and backflow transformations are both vital in 
achieving the best energies, indicating that they capture 
different aspects of the wavefunction correlation. Back- 
flow appears to describe parts of the dynamic correlation 
beyond the SJ wavefunction while the low-energy CSFs 
beyond the HF ground state configuration describe static 
correlation energy. Our results contribute to the growing 
evidence^ that adding backflow transformations to SJ 
wavefunctions can significantly improve the nodal surface 
for systems from atoms to solids. 

It should be noted that, while it might be difficult 
to substantially improve the Jastrow factors and back- 
flow transformations, we could easily include many more 
CSFs in the trial wavefunctions without a prohibitive in- 
crease in computational expense. The orbitals in the 
CSFs could also be optimized within VMC. The ener- 
gies reported here do not therefore represent the current 



practical limits of the VMC and DMC methods for all- 
electron atoms. 

For general systems, the number of CSFs required 
to obtain a significant improvement in the energy in- 
creases rapidly with the number of electrons N. The 
number of required parameters in the backflow functions 
only increases (roughly linearly) with the number of in- 
equivalent aoms. The cost of moving an electron in a 
VMC/DMC calculation scales approximately as V 2 . In 
a VMC-BF /DMC-BF calculation the cost scales as N 3 
because all of the orbitals must be evaluated at the po- 
sitions {x^} when an electron is moved. The cost of a 
VMC-MD /DMC-MD calculation scales as N 2 M, where 
M is the number of determinants, and the cost of a 
VMC-MDBF /DMC-MDBF calculation scales as N 3 M. 
The number of moves required to obtain a fixed statis- 
tical error bar in the energy is smaller for a more ac- 
curate trial wavefunction, so that the pre-factors in the 
cost for the more sophisticated wavefunction forms are 
smaller. As an example, for the C atom the relative 
computational costs to achieve the same error bar are 
1.0/3.6/6.2/17.8 for the VMC SD/SDBF/MD/MDBF 
calculations, while in DMC the corresponding cost ra- 
tios are 9.0/13.7/15.1/31.9. Overall, we expect that we 
can maintain the level of accuracy achieved here for small 
molecules. 

We obtained superior results with the energy mini- 
mization scheme of Umrigar et al. 1 ^ than with variance 
minimization procedures^ - — . The energy minimization 
method is particularly advantageous when multiple de- 
terminants are used because it is well-suited to optimiz- 
ing linear parameters, whereas variance minimization en- 
counters some difficulties in optimizing CSF coefficients. 
The combination of energy minimization and the MD and 
BF wavefunction forms has led to substantial reductions 
in the VMC energies, which are very important because 
the variance of the DMC energy has been found to be pro- 
portional to the error in the VMC energ y 38 i 39 . Although 
reductions in the DMC energies from the improved trial 
wavefunctions are smaller than in VMC, they represent 
substantial fractions of the remaining correlation energy. 
Our best DMC energies for Li and Be are within error 
bars of previously published DMC energies, but our re- 
sults for the higher atomic number atoms are markedly 
superior. 
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